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(54) TWe: METHOD AND APPARATUS FOR DETECTING PREDETERMINED MOLECULAR STRUCTURES IN A SAMPLE 



(57) Abstract 

A predetermined molecular structure in a sample is detected by sensing a pattern in which the sample binds to an array of binding 
sites in a molecular detection device (20), and comparing the pattern to a reference pattern to detect the predetermined molecular structure 
in the sample (22). In one embodiment, the reference pattern U generated by sensing a pattern in which a reference sample containing 
the predetermined molecular s true fore binds to a like array of binding sites. In another embodiment, the reference pattern is generated by 
predicting a pattern in which the predetermined molecular structure binds to the array of binding sites. 
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5 METHOD AND APPARATUS FOR DETECTING PREDETERMINED 

MOLECULAR STRUCTURES IN A SAMPLE 

Field of the Invention 

10 The present invention relates to methods and 

system for molecular detection. 

Background of the Invention 

15 An increased effort has been directed toward 

the development of chips for molecular detection. 
Typically/ a molecular detection chip includes a 
substrate on which an array of binding sites is 
arranged. Each binding site, or hybridization 

20 site, has a respective molecular receptor which 

binds or hybridizes with a molecule having a 
predetermined structure. 

A sample solution is applied to the molecular 
detection chip, and molecules in the sample bind or 

25 hybridize at one or more of the binding sites. The 

particular binding sites at which hybridization 
occurs are detected, and one or more molecular 
structures within the sample are subsequently 
deduced. 

30 Of great interest are molecular detection 

chips for gene sequencing. These chips, often 
referred to as DNA chips, utilize an array of 
selective binding sites each having respective 
single-stranded DNA probes. A sample of single- 

35 stranded DNA fragments, referred to as target DNA, 

is applied to the DNA chip. The DNA fragments 
attach to one or more of the DNA probes by a 
hybridization process. By detecting which DNA 
probes have a DNA fragment hybridized thereto, a 



FIG. 6 illustrates a reference pattern for 
detecting an a-c-t nucleotide sequence in a sample; 

FIG. 7 is an example of a pattern generated by 
a sample having an unknown molecular structure; 

FIG. 8 is a flow chart of additional "steps 
which can be utilized to detect the predetermined 
molecular structure; 

FIG. 9 is a block diagram of an apparatus for 
detecting a predetermined molecular structure in a 
sample; and 

FIG. 10 is a flow chart of an embodiment of a 
method of gene discovery in accordance with the 
present invention . 

Detailed Description of a Preferred Embodiment 

Embodiments of the present invention 
advantageously provide improved information 
processing approaches to detecting predetermined 
molecular structures using a miniaturized device 
having an array of biological sensors. Just as 
semiconductor devices are designed to perform 
specific functions, a diagnostic device in 
accordance with the present invention is designed 
to perform one or more specific diagnostic tests. 

FIG. 1 is a flow chart of an embodiment of a 
method of detecting a predetermined molecular 
structure in a sample. In general, the method can 
be utilized for detection of a variety of molecular 
structures in a variety of different types of 
samples. Examples of the different types of 
samples include, but are not limited to, medical 
samples, environmental samples, agricultural 
samples, and other samples applicable tc 
diagnostics . 
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5 hybridizes with a molecule having a predetermined 

structure . 

Each molecular receptor typically includes a 
biological or synthetic molecule having a specific 
affinity to the molecule to be detected. Of 

10 particular interest is a molecular receptor having 

a chain of at least one nucleotide to hybridize 
with a molecule having a complementary chain of at 
least one nucleotide. Here, for example, the 
molecular receptor can include a DNA probe for 

15 detecting a corresponding, complementary DNA 

sequence in the sample. 

It is noted, however, that the scope of the 
invention is not limited to sensing the 
hybridization of DNA molecules. For example, 

20 embodiments of the present invention can be 

utilized to detect RNA hybridization and antibody- 
antigen binding events. As another alternative, 
the molecular detection device can include an array 
of detection sites, such as in the context of an 

25 oligonucleotide ligation assay (OLA) . Using a 

ligase chain reaction, pairs of oligonucleotides 
are utilized to amplify a selected oligonucleotide 
sequence. To detect the selected oligonucleotide 
sequence, a corresponding detection site is 

30 screened for full-length ligated oligonucleotides 

using any of the sensing approaches described 
herein. 

As indicated by block 12, an optional step of 
tagging molecules within the sample is performed. 
35 Each molecule is tagged with a member which can be 

sensed by the molecular detection device. Such 
members are commonly referred to in the art as 
• tags, markers, and labels. Examples of such 
members include, but are not limited to, 



After hybridization, an optional step of 
removing unwanted molecules from the binding sites 
can be performed, as indicated by block 18. The 
step of removing unwanted molecules can be 
performed by generating an electric field "having 
the same polarity as the charge of the unwanted 
molecules. The electric field acts to repel 
unwanted molecules from the binding sites. As an 
alternative to, or in conjunction with, the field- 
based approach, a thermally-assisted approach can 
be utilized to remove unwanted molecules. Here, 
the temperature at the binding sites is raised, in 
dependence upon a melting temperature, to 
dissociate partially-bound molecules from the 
molecular receptors. Regardless of the approach 
utilized, the unwanted molecules to be dehybridized 
can include unbound molecules and partially-bound 
(i.e. non-specif ically bound) molecules. 

Typically, the step of removing unwanted 
molecules does not remove all unwanted molecules 
from the binding sites. This step is beneficial, 
however, in improving the accuracy of detection as 
outlined in subsequent steps. 

As indicated by block 20, the method includes 
a step of sensing a pattern in which the sample 
binds to an array of binding sites in a molecular 
detection device. The pattern can be sensed using 
a variety of approaches, including but not limited 
to, optical approaches, radioactive-sensing 
approaches, electronic approaches, and magnetic 
approaches. The specific approach utilized depends 
upon the type of tagging member attached to the 
molecules in the sample. 

Preferably, the step of sensing the pattern 
includes sensing an intensity or a magnitude of 



determining a difference between the pattern and 
the reference pattern. Here, the predetermined 
molecular structure can be detected when a measure 
of the difference is within a predetermined range. 

For optical sensing embodiments, the step of 
comparing includes a step of comparing at least one 
image of the pattern to an image of the reference 
pattern. 

As indicated by block 24, the method 
optionally comprises a step of determining a 
confidence level of detecting the predetermined 
molecular structure in the sample. The confidence 
level indicates a degree of significance of the 
result obtained in the step of comparing in block 
22. 

To screen the sample for a plurality of 
different molecular structures, the steps indicated 
by blocks 22 and 24 can be repeated for a plurality 
of different reference patterns. Here, for 
example, a genomic sample can be screened to 
determine if it contains any of a plurality of 
predetermined base sequences. 

FIG. 2 is a flow chart of an embodiment of a 
method of generating the reference pattern. 
Typically, the reference pattern is generated prior 
to performing the steps indicated in FIG. 1. 

As indicated by block 30, the method includes 
a step of providing a reference device having a 
like array of binding sites as the molecular 
detection device used for detection. If desired, 
the same molecular detection device can be utilized 
for generating the reference pattern and for 
subsequent detection of an unknown molecular 
structure in a sample. Typically, however, another 
like device is utilized. 



11 

molecular structure. Alternatively, the steps 
indicated by blocks 30, 32, 34, 36, 38, 40, and 42 
can be repeated to apply the same predetermined 
molecular structure to a number of like reference 
devices. Either approach may be utilized to 
provide a plurality of reference patterns for the 
same predetermined molecular structure. 

Sensed patterns formed by a sample having an 
unknown molecular structure can be compared to each 
of the above-described plurality of reference 
patterns, or a statistical model thereof, to detect 
the predetermined molecular structure in the 
sample . 

FIG. 3 is a flow chart of another embodiment 
of a method of generating the reference pattern. 
As- indicated by block 50, the method includes a 
step of determining an architecture of the array of 
binding sites of the molecular detection device. 
This step can include determining a layout of the 
binding sites, and determining the type of 
molecular receptor at each of the binding sites. 

As indicated by block 52, the method includes 
a step of predicting a reference pattern in which 
the predetermined molecular structure binds to the 
array of binding sites. The reference pattern is 
predicted based upon the predetermined molecular 
structure and the architecture of the molecular 
detection device. Preferably, the reference 
pattern includes a predicted intensity of binding 
at each of the binding sites. 

Regardless of the approach taken, the 
reference pattern acts as a novelty filter which is 
predictive of a successful or a desirable test 
result . 
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5 as binding sites indicated by reference numeral 64, 

have a lesser intensity of binding. Those binding 
sites having two mismatching complementary bases, 
such as those indicated by reference numeral 66, 
have an even lesser intensity of binding. ' 3inding 

10 sites with no matching complementary bases, such as 

those indicated by reference numeral 68, have a low 
intensity of binding. 

FIG. 6 illustrates a reference pattern for 
detecting an a-c-t nucleotide sequence in a sample. 

15 The reference patterns in FIGS. 5 and 6 can be 

sensed using a reference sample, or can be 
predicted based on the number of mismatching bases 
at each binding site. 

For purposes of illustration, the sequences in 

20 this example are assumed to have a specific 

orientation. As a result, an a-c-t sequence and a 
t-c-a sequence do not specifically hybridize at the 
same binding site. It is noted, however, that this 
should not be construed as a limitation in the 

25 scope of the present invention. 

FIG. 7 is an example of a pattern generated by 
a sample having an unknown molecular structure. 
The pattern is generated by applying a sample of 
tagged single-stranded DNA molecules to the 

30 molecular detection device, allowing the molecules 

to hybridize to the binding sites, and optionally 
removing unwanted molecules. 

The resulting pattern shows a high intensity 
of binding at a t-g-a site "70. If standard 

35 detection techniques were utilized, one would 

conclude that the sample includes an a-c-t 
nucleotide sequence (i.e. the complement of t-g-a). 
However, the overall pattern is better correlated 
to the reference pattern for the a-c-g nucleotide 
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5 selected binding sites, or can be modified for all 

of the binding sites. 

Thereafter, a step of sensing a pattern, 
indicated by block 82, is performed- By repeatedly 
raising the temperature and sensing a resulting 

10 pattern, a plurality of temperature-dependent 

patterns is generated. 

As indicated by block 84, a step of comparing 
at least one of the plurality of temperature- 
dependent patterns to a corresponding at least one 

15 of a plurality of temperature-dependent reference 

patterns is performed. Here, for example, each of 
the temperature-dependent patterns can be compared 
to a corresponding one of the temperature-dependent 
reference patterns. Alternatively, only selected 

20 ones of the temperature-dependent patterns can be 

compared to corresponding reference patterns. A 
correlation measure and/or a difference measure is 
computed based on this comparison. A predetermined 
molecular structure is detected when the measure is 

25 within a predetermined range. 

In one embodiment, the temperature-dependent 
pattern having a greatest variability of intensity 
is selected for comparison. The variability of 
intensity is greatest at a temperature which 

30 dissociates many non-specifically-bound molecules, 

but does not significantly dissociate specifically- 
bound molecules. This pattern can be compared with 
a corresponding reference pattern to detect a 
predetermined molecular structure. It is noted 

35 that a variety of different measures of variability 

can be utilized, including but not limited to, 
sample variance and sample standard deviation. 

FIG. 9 is a block diagram of an apparatus for 
detecting a predetermined molecular structure in a 
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5 present invention. As indicated by block 100, the 

method includes a step of sensing a pattern of 
detection for a sample applied to a molecular 
detection device having a plurality of detection 
sites. The sample is taken from a first species 

10 having unknown genes to be discovered. Any of the 

various approaches described herein can be utilized 
for sensing the pattern. 

As indicated by block 12, the method includes 
a step of determining an architecture of the 

15 plurality of detection sites of the molecular 

detection device. 

A step of reading a nucleotide sequence from a 
database is performed, as indicated by block 104. 
In general, any nucleotide sequence can be read. 

20 Of particular interest, however, is a nucleotide 

sequence from a second species other than the first 
species. For example, the nucleotide sequence can 
include a gene from a fruit fly, -while the sample 
in which gene discovery is to be performed is from 

25 a human. 

As indicated by block 106, the method includes 
a step of predicting a reference pattern which 
would be detected if the nucleotide sequence were 
applied to the molecular detection device. As 

30 indicated by block 108, a step of comparing the 

pattern to the reference pattern is performed to 
determine whether the nucleotide sequence is within 
the sample. 

The steps of reading a nucleotide sequence 

35 from the database, predicting a reference pattern 

for the nucleotide sequence, and comparing the 
pattern to the reference pattern are repeated to 
discover the presence of genes across different 
species . 
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5 Claims 

1. A method of detecting a predetermined 
molecular structure in a sample, the method 
comprising the steps of: 

10 sensing a pattern in which the sample binds to 

an array of binding sites in a molecular detection 
device; and 

comparing the pattern to a reference pattern 
to detect the predetermined molecular structure in 
15 the sample. 

2. The method of claim 1 further comprising 
the step of determining a confidence level of 
detecting the predetermined molecular structure in 

20 the" sample. 

3. The method of claim 1 wherein the step of 
comparing includes at least one of determining a 
difference between the pattern and the reference 

25 pattern, determining a correlation between the 

pattern and the reference pattern, comparing at 
least one image of the pattern to an image of the 
reference pattern, comparing a plurality of 
temperature-dependent patterns to a plurality of 

30 temperature-dependent reference patterns. 

4. The method of claim 1 wherein an intensity 
level of at least one of the binding sites is 
indicative of at least one of molecules bound at a 

35 respective binding site, a binding strength at a 

respective binding site, and a melting temperature 
at a respective binding site. 
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5 array of binding sites or by predicting a pattern 

in which the predetermined molecular structure 
binds to the array of binding sites. 



WO 98/08083 



PCT/US97/14372 ~> 



2/5 



( START ) 



JO 



PROVIDE A REFERENCE DEVICE HAVING A LIKE ARRAY OF BINDING SITES | 



32 



TAG MOLECULES IN A REFERENCE SAMPLE 



34 



APPLY REFERENCE SAMPLE TO REFERENCE DEVICE 



36 



ALLOW REFERENCE SAMPLE TO HYBRIDIZE 



38 



REMOVE UNWANTED MOLECULES FROM THE BINDING SITES 



40 



SENSE A PATTERN IN WHICH THE REFERENCE SAMPLE BINDS TO 
THE BINDING SITES 



42 



MODIFY TEMPERATURE 



STOP 

FIG. 2 

( START ^ 



^1 



50 



DETERMINE ARCHITECTURE OF THE ARRAY OF BINDING SITES OF THE 
MOLECULAR DETECTION DEVICE 



i 

PREOICT A PATTERN IN WHICH THE PREDETERMINED MOLECULAR 
STRUCTURE BINDS TO THE ARRAY OF BINDING SITES 




FIG.3 



til 



WO 98AJ8083 



PCT/US97/14372 -» 



4/5 



^aaaMacWaca^ 

v///ym&M////. 



mm 







FIQ.6 




ul 



INTERNATIONAL SEARCH REPORT 


Intern a lioeal application No. • 
PCT/US97/14372 


A. CLASSIFICATION OF SUBJECT MATTER 




IPC(6) :Please See Extra Sheet 




US CL .Please See Extra Sheet. 


and IPC 


AccordinR to International Patent Classification (IPC) or to both oaboosl classification 


a FIELDS SEARCHED 



documentation searched (classification system folk) wed by classification symbols) 



U.S. : 204/153.1. 400, -403; 435/6. 283.1, 2S7.1, 287-2; 43673 IS. 324, 807. W9 
Documentation searched other tbao minimum doouncmlabofl to the extent that web doc urn rati arc included in the fields searched 



Electronic data base consulted during the international aearch (owne of data base and. where practicable, search terms used) 
APS. CAPLUS. MEDLINE. WPIDS. SCISEARCH. MEDLINE. EMBASE. BtOSIS 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant so claim No. 


X 


US 5,532,128 A (EGGERS et al) 02 July 19%. See entire 
document. 


1-10 


X 


PEASE AG. Light-generated Oligonucleotide Arrays for Rapid 
DNA Sequence Analysis. Proc. Natl. Acad. Sci. May 1994. Vol 
91. pages 5022-5026, especially Abstract. 


1-10 


X 


STIMPSON DL Real-time Detection of DNA Hybridization and 
Melting On Oligonucleotide Arrays by using Optical Wave Guides. 
Proc. Natl. Acad. Sci. July 1995. Vol 92. pages 6379-6383, 
especially Abstract 


1-10 


A,P 


US 5,605,662 A (HELLER et al) 25 February 1997. 


1-10 



|"x| Fotther documents an listed is the continuation of Box C. Q See patent family annex 



SpmtJ Mtoforw of oitod ik \m m wm W 
lMn|a«|<Militib»r* 



to U of 



Mm oocuaanl p w b h» bt< kIW ftw totmttkxwl filing <**to or priority 
bat* >»d art io radii* wilfc tb* *pptk»uoo but ritod to uukntud 
ttx* praMpb of tb*cry rad*r lying tb* btmmim 



■ art publi»fe*4 <m or *JW to* « 



dew wH vakb may throw oovbo priority 
tated* to wtobbia Ow niiMirort— dot* of a*«4 
f*oi*l mm <« yiek W) 

dotMMt r*f*rrini to as *nl dlitUn, a**, 
do j— *nlpwM»^»d prior to t 



Mtioot! filk* <tota 
•tom(t) or wbioo » 



docuB«*rt of porttouUr ntwipci: iH* cUancd mycotica canoot b« 
bfarW BOTtt or cvac4 b* coasioWvd to bivohr* «o io? **trr* Mop 
wboa 4>* doowool » b**> kfeo* 



oo«— ,«rt of porticultf ntniDM; tb* e»im*4 mroobo* cun«t 
in— iiWirt to ktrolv* *e invoDtrt* M*p wba tb* ooctmoDt a 
oam ao>4 wis* om or «or* ooior wd QocumontK, awch ooabtomapa 
bana ofcrtow to • p*»on »fciU*4 is tb* «rt 

Jinyiml RMto of tb* Mm* ptlcot f unity 



-~ » * 

Date of the actaa) conpfetion of the international search 

24 NOVEMBER 1997 


Date of nailing of^h^uitcniational search report 

/flZ KC 199J /^T 


Name and mailing address of the ISAAJS 
CoanxtissioaeT of Paknts aed Trademarks 

BoxPCT 

Washington. D.C 30231 / 
Facsimile No. (703) 3C5-3230 / / 


A»thoni& ifljeer - ^ X I 1 



INTERNATIONAL SEARCH REPORT 



bternitiootl application No. 
PCT/US97/14372 



A. CLASSIFICATION OF SUBJECT MATTER: 

ipc m 

OOJN 27AX), 27/2$. 33/343, 33/351; CJ2Q 1/68; A01N 1/02; CI2M 1*0; 

A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 

204/153.1. 400, 403; 435/6. 283.1. 287.1. 287.2; 436/518. 524, 807. 809 



Form PCT/ISA/210 (extra ahectXJuly 1992)* 



3ft 



